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SECTION  1 


INTRODUCTION 


Over  the  last  two  years  there  has  been  a  growing  interest  in  a  Bayesian  ap¬ 
proach  to  the  problem  of  testing  compliance  to  the  threshold  test  ban  treaty  (TTBT). 
(See  Nicholson  et  al.  [1991]  and  Shumway  and  Der  [1990].)  During  that  same  time  it 
has  been  pointed  out  that  previous  data  does  furnish  some  information  about  the  un¬ 
known  parameters  in  the  yield  estimation  problem  even  though  the  associated  yields 
are  unknown.  For  example,  since  for  equal  slope  parameters,  m4(P)  —  mb(Lg)  does 
not  depend  on  yield,  one  can  estimate  the  mean  and  variance  of  this  difference  from 
data  for  which  the  yields  are  unknown.  Typically,  the  size  of  such  data  sets  is  large 
enough  to  provide  excellent  estimates  of  these  parameters.  This  information  can  then 
be  used  to  improve  estimates  of  future  yields  (See  Nicholson  et  al.  [1991]). 

Our  approach  is  a  modification  of  the  approach  suggested  by  Shumway  and 
Der  [1990].  They  have  shown  that  prior  information,  supplied  for  example  by  a  panel  of 
experts,  can  be  useful  for  improving  confidence  intervals  on  the  log  yield,  particularly 
when  there  is  little  calibration  data.  Our  approach  also  allows  expert  opinion  to 
be  used  as  prior  information.  In  addition,  we  include  the  information  gained  from 
previous  data  for  which  the  yields  are  unknown,  in  terms  of  constraints  on  the  mean 
and  variance  of  the  difference  of  the  seismic  magnitudes. 

In  this  report  we  investigate  the  effect  of  such  information  on  a  statistical  test 
of  TTBT  compliance  by  comparing  the  power  and  F-number  of  the  test  based  on  two 
Bayesian  approaches,  with  and  without  the  constraints.  We  develop  a  Bayesian  for¬ 
mulation,  treating  the  unknown  intercepts  and  covariance  matrix  as  random  variables 
for  which  a  joint  Bayesian  prior  may  be  specified,  with  and  without  the  constraints  on 
the  prior.  The  case  in  which  the  slopes  are  also  treated  as  unknown  will  be  discussed 
in  a  subsequent  report. 

For  comparison,  we  consider  a  special  case  in  which  the  covariance  matrix 
is  treated  as  known,  without  uncertainty,  so  that  expressions  for  the  power  and  F- 
numbers  may  be  computed  analytically.  (This  assumption  is  also  made  by  Nicholson 
et  al.  [1991]  in  their  analysis.)  A  parametric  study  is  performed  to  assess  how  the 
probabilities  of  detecting  violations  depend  on  the  true  calibration  parameters  and 
the  number  of  calibration  events  for  which  the  yields  are  known.  The  results  show 
that  when  the  assumptions  of  the  Bayesian  priors  (constrained  and  unconstrained)  are 
correct,  the  test  based  on  the  constrained  Bayesian  approach  has  a  greater  than  or 
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equal  probability  of  detecting  a  violation  (referred  to  as  the  power  of  the  test),  at  the 
same  significance  level,  than  the  test  without  the  constraints  for  all  cases  examined. 
The  power  for  the  constrained  case  is  particularly  greater  than  the  power  for  the 
unconstrained  case  when  little  calibration  data  is  available  and  there  is  considerable 
uncertainty  in  one  of  the  intercepts. 

We  also  examine  the  robustness  of  the  two  tests  when  the  Bayesian  assump¬ 
tions  are  incorrect.  That  is,  we  compare  the  sensitivity  of  the  power  functions  and 
false  alarm  rates  of  the  two  tests  for  monitoring  a  given  test  site  when  the  a  priori 
assumptions  are  incorrect,  e.g.,  when  the  true  intercept  parameters  are  out  in  the  tails 
of  the  joint  prior  distribution.  Analytic  expressions  are  also  given  for  these  probabili¬ 
ties.  A  discussion  of  the  interpretation  of  the  Bayesian  tests  for  monitoring  given  test 
sites  is  included. 

The  Bayesian  tests  are  also  compared  to  a  test  of  hypothesis  for  which  all  of 
the  calibration  parameters  are  treated  as  known,  and  one  for  which  only  the  intercepts 
are  unknown  and  estimated  strictly  from  data.  The  former  test  provides  a  reference  by 
which  the  other  tests  may  be  judged.  That  is,  since  all  of  the  calibration  parameters 
are  known,  this  test  will  have  the  greatest  power  for  fixed  significance  level.  The 
latter  test  was  considered  by  Hafemeister  [1987].  Comparison  with  this  test  allows  us 
to  gauge  the  usefulness  of  the  a  priori  information,  including  that  provided  by  data 
for  which  the  yields  are  unknown. 

The  remainder  of  this  report  is  organized  as  follows.  In  Section  2  we  define 
the  notation  of  our  model.  In  Section  3  we  develop  the  Bayesian  approach,  with  and 
without  the  constraints,  and  in  Section  4  we  establish  a  test  of  compliance  based  on 
these  approaches  and  define  the  power  and  the  F-number  of  the  test.  In  Section  5  we 
present  the  results  of  a  power  comparison  between  the  constrained  and  unconstrained 
approaches  when  the  appropriate  a  priori  assumptions  hold,  and  in  Section  6  we 
address  the  issue  of  robustness.  Finally,  in  Section  7  we  draw  some  conclusions  from 
our  study  concerning  the  usefulness  of  our  approach  for  monitoring  specific  test  sites, 
and  describe  future  efforts  in  this  area. 
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SECTION  2 


NOTATION 


Suppose  that  there  are  two  seismic  magnitudes  observed  for  each  event,  e.g., 
mb(P)  and  mb(Lg).  Let 

rriij  =  i-th  seismic  magnitude  of  the  j-th  event 
Wj  =  log  yield  of  the  j-th  event. 

Then  for  constants  Ai  and  B,,  we  assume  the  magnitudes  and  log  yield  are  related  by 
TTiij  —  Ai  "4  B{W j  -h  6|j ,  i  2,  j  1, . . . ,  n 

=  o 

(i) 

Var  etj  =  a], 

Cov(ei„e2y)  =  p{t\,ti)ouoi2 

We  now  consider  the  problem  of  estimating  yield  and  establishing  a  compli¬ 
ance  test  for  future  events  under  the  following  scenario: 

The  Ai  and  the  covariance  matrix  of  £*,  are  unknown,  but  sufficient  infor¬ 
mation  is  available  to  define  a  joint  Bayesian  prior  for  these  quantities.  There  are  n 
calibration  events  for  which  the  yields  are  known,  and  the  n  +  1  yield  is  to  be  esti¬ 
mated.  We  also  assume  the  slopes  are  known,  and  hence  w.l.o.g.  set  Bx  —  B2  —  1. 
(The  generalization  to  other  known  values  of  the  slopes  is  straightforward.) 

We  will  also  exploit  the  information  contained  in  previous  magnitude  data 
for  which  the  associated  yields  are  unknown.  We  will  hereafter  refer  to  these  data  sets 
as  “no-yield  data.”  Let 


E[mij  -  m2j]  =  n 

(2) 

Var{mij  -  mj,)  =  A2. 

(3) 
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Under  the  assumption  that  2?i  =  Bj  =  1,  these  quantities  are  independent  of  the 
yield.  For  other  values  of  Bit  Z?2,  the  appropriate  yield-independent  combination  of  the 
magnitudes  is  .  Typically,  large  no-yield  data  sets  are  available 

which  may  be  used  to  provide  excellent  estimates  of  p  and  A*.  We  will  assume  for 
now  that  such  data  sets  are  sufficiently  large  that  the  estimated  values  of  p  and  A* 
may  be  considered  to  be  the  actual  values. 

We  now  define  random  variables  corresponding  to  the  unknown  A ,  and  the 
covariance  matrix  of  e,; : 

{°i>  ai}  =  joint  normal  random  variates,  for  which  we  assume  the  means,  pai 
and  paj,  variances,  a\x  and  and  correlation,  pa,  are  given  a  pri¬ 
ori.  The  means  are  taken  as  the  “most  likely”  values  of  the  A,,  and 
variances  and  correlation  reflect  the  “best  guess”  of  the  uncertainty 
in  our  knowledge  of  the  A,-. 

{oi.Cttp}  =  joint  random  variates  for  which  we  assume  sufficient  prior  information 
is  available  to  completely  specify  the  probability  distribution  function 
(pdf).  (Several  reasonable  pdfs  are  specified  below.)  The  prior  dis¬ 
tribution  reflects  our  uncertainty  in  the  knowledge  of  oCl,  otJ  and 
p(tu*i)- 

We  then  define 

^ij  —  "b  fiji  *  =  1, 2;  j  —  1, . . .  ft.  (4) 

We  admit  that  we  are  abusing  the  notation  here,  since  mi}  in  (1)  is  actually  the 
“conditional”  m y  in  (4),  i.e.,  given  at  —  Ax  and  a2  =  A2,  and  we  have  failed  to  make 
that  distinction  in  our  notation.  We  will  also  use  the  same  variables  a,,  of  and  p  to 
denote  the  random  variables  and  the  values  of  the  random  variables.  Nevertheless, 
this  should  not  introduce  any  confusion.  For  example,  the  variables  that  appear  as 
the  arguments  of  the  pdfs,  below  will  always  represent  values  of  the  random  variables. 
Also,  let 

a  =  {ai,a2} 
a  =  {auo2,p} 

—  (mu,  m21, . . .  m1Jc,  m2*) , 

where  we  assume  that  the  associated  yields  are  known  for  k  <  n,  and  unknown  for 
k  =  n  +  1. 


SECTION  3 


TECHNICAL  DEVELOPMENT 


3.1  Bayesian  Approach. 

Consider  now  the  problem  of  determining  the  joint  distribution  /  of  miin+i 
and  given  m„.  This  distribution  will  be  used  to  establish  a  test  of  compliance. 

We  may  write 


/(^l.n+i.mj^+ilm,,)  =  //i(mlin+1,m2in+1>a,a|mn)dad<r 


I  /a  ("»lin+1,m2)n+i|a,ff,mn)  f3  (a,<r|m„)  dado. 


(5) 


where  /2  and  f3  are  functions  to  be  specified.  Using  Bayes’  law,  f3  may  be  expressed 
as 


h  (a,<r|mn) 


/i(a,<r)L(mn|a,<y) 

J  /i(a,«r)L(mn|a,a)dad4r 


(6) 


where  h  is  the  joint  prior  distribution  of  a  and  <r,  and  L  is  the  likelihood  function  for 
the  data,  mn,  given  values  of  a  and  a.  Note  that  (miin+1,mjin+i)  is  dependent  on  mn, 
as  defined  by  (4),  since  the  random  vector  (ai,a2)  is  common  to  both.  However,  for 
fixed  values  of  (a^aj),  (min+i,m2>n+i)  does  not  explicitly  depend  on  mn.  Therefore 


/2  (mi,n+i,Tn2>n^.i|a,<T,fnn)  —  /j  (mi>n+i,m2>n+i|a,<T) . 


(7) 


Assuming  (mu,m2*)  are  bivariate  normal,  then 

/a  (mu,m!*|a,<r)  =  Cexp  (X2k  -  2 pXkYk  +  Y^)  J  =  0(m14,m2t)  (8) 


for  k  =  1, . . . ,  n,  where 
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c 


1 


2*aialv/r^ 


Xt 


n 


mu  -Wk-  a t 

<?i 


mj*  --  W*  -  a2 


ff2 


Using  the  fact  that  for  given  values  of  (a1?  a2),  the  conditional  (mi},  m2j)  and  (mu,  m2*) 
are  independent  for  all  j  ^  Jfc,  it  then  follows  that 


n 

L(mn|a,«r)  =  IJ  0(mu,m2fc).  (9) 

*= i 

Prom  (5),  (6),  (8)  and  (9),  /(mi.n+i,  m2|n+i|mn)  is  completely  determined  if  h( a, a)  is 
available. 


Shumway  and  Der  [1990]  have  derived  this  expression  for 
/(mi,n+i,m2n+1|mn).  In  their  formulation,  the  slopes  are  also  treated  as  unknown. 
Thus,  a  in  our  notation  would  represent  the  matrix  of  intercepts  and  slopes  in  their 
scheme.  We  will  depart  at  this  point  from  their  approach  and  consider  the  impact,  on 
the  corresponding  test  of  hypothesis,  of  the  further  assumptions  of  (2)  and  (3),  where 
H  and  A2  are  given  constants,  determined  from  previous  no-yield  data. 


3.2  The  Constrained  Prior  Distribution. 


We  impose  the  constraints  (2)  and  (3)  by  replacing,  in  (6),  the  prior  h(a,<r) 
with  a  new  prior  he(a,<r).  If  the  mean  and  variance  of  the  difference  are  precisely 
known,  then  we  may  take 


he(a,«r)  =  M  h(a,<r)  6(ai  -  a2  -  /z)  6(<?2  +af  -  2po\o<i  -  A2),  (10) 

where  M  is  a  normalization  constant,  and  tf(z)  is  the  Dirac  delta-function  of  distribu¬ 
tion  theory.  That  is,  the  prior  distribution  we  will  use  in  (6)  is  now  the  constrained 
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distribution  he( a,<r),  which  represents  the  original  assumptions  concerning  the  dis¬ 
tribution  of  a  and  a,  i.e.,  h( a,<r),  and  the  additional  assumption  that  the  mean  and 
variance  of  the  difference  of  the  magnitudes  are  known. 

In  practice,  the  mean  and  variance  of  the  difference  of  the  magnitudes  are 
not  precisely  known,  and  it  may  be  more  appropriate  to  treat  these  quantities  as 
distributed.  We  will  assume,  however,  that  the  no-yield  data  sets  are  sufficiently 
large  such  that  using  the  delta-functions  represents  a  reasonable  approximation.  This 
assumption  allows  us  to  explore  the  maximum  benefit  of  using  no- yield  data. 

The  joint  prior  distribution  h(a,  a)  must  still  be  specified.  There  are  several 
joint  prior  distributions  that  are  worth  investigating.  Shumway  and  Der  [1990]  con¬ 
sider  a  joint  prior  distribution  that  is  the  product  of  a  bivariate  normal  for  a  given  <r, 
and  an  inverted  Wishart  for  a.  We  will  treat  the  a,  as  joint  normal  random  variates 
independent  of  and  p,  as  is  done  by  Nicholson  et  al.  [1991].  That  is,  let 

(:)  -Hz) 

Reasonable  choice  for  the  prior  distribution  of  a  include  the  Wishart  or  inverted 
Wishart,  which  are  multivariate  sampling  distributions,  or  if  the  a,  and  p  are  treated 
as  independent  random  variables,  one  could  use  the  product  of  independent  sampling 
distributions  (see,  for  example,  Anderson  [1958]) 

£[N  -  1)  ~  x\N  -  1)  (12) 

p  ~  Approximately  log  normal.  (13) 

(The  sampling  distribution  for  p  is  rather  cumbersome.  An  expression  for  this  pdf  may 
be  found  on  p.  68  of  Anderson  f 1958]).  The  parameters  for  this  pdf  are  denoted  by  the 
mean,  p0,  and  the  number  of  degress  of  freedom  N-  1,  which  determines  the  variance. 
The  parameters  /*«,.,  oa>,  pa,  a o,.,  po  and  N  may  be  given  by  a  panel  of  experts. 

If  the  parameters  of  the  joint  prior  are  to  be  furnished  by  a  panel,  it  is 
reasonable,  and  technically  less  tedious,  to  consider  the  case  where  the  prior  for  o 
is  also  given  by  a  joint  normal  distribution,  rather  than  the  sampling  distributions. 
This  joint  prior  would  reflect  the  panels  ability  to  provide  estimates  of  the  a  priori 
parameters  and  their  uncertainties.  Although  it  is  certainly  feasible  to  translate  the 
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uncertainties  furnished  by  a  panel  into  an  effective  number  of  degrees  of  freedom,  N, 
this  choice  of  prior  would  alleviate  the  awkwardness  of  doing  so.  If  this  prior  is  used, 
it  would  be  assumed  that  the  means  and  variances  of  the  normal  distributions  for  o, 
and  p  are  such  that  the  probability  that  values  of  these  random  variables  are  outside 
of  their  physically  acceptable  ranges  is  effectively  zero. 


S.S  Summary  of  Technical  Developments. 

It  is  useful  at  this  time  to  review  our  results  to  this  point.  We  have  shown 
that  the  unconstrained  distribution  of  future  magnitudes,  given  previous  calibration 
data  and  prior  information,  may  be  given  by 

/ (fWi.n+1 , m2in+i |mn)  =  J  ft  (mlin+1,m2>n+1ja,<r)  /,  (a,*|mn)  dado,  (14) 


where  /j  is  given  in  (8), 


h  (a,<r|mn) 


h(a,o)L(mn\a,o) 

J  h( a,  o)L(mn |a,  o)dado 


(15) 


L(mn\a,o)  is  given  in  (9),  and  h(a,o)  is  specified  by  (ll)-(13). 

Analogous  expressions  may  also  be  given  for  the  constrained  distributions, 
which  we  denote  with  the  subscript  “c” .  Thus,  the  constrained  distributions  corre¬ 
sponding  to  /  and  fs  are  denoted  by  fe  and  /se,  and  the  constrained  prior  distibution 
is  given  by 


he(a,o)  =  U  h(a,o)  6(ai  -  o2  -  p)  6(o\  +ol~  2polo2  -  A*),  (16) 

where  the  normalization  constant  M  is  needed  to  ensure  that  he(a,  o)  is  a  density 
normalized  to  unity. 

We  are  almost  in  a  position  to  develop  a  test  of  compliance.  First,  it  is 
useful  to  define  a  weighted  combination  of  the  magnitudes  by 


^r,n+l  —  "t"  (1  r)^2,n+lt 


(17) 
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where  0  <  r  <  1.  The  marginal  distribution  of  mri„+i,  given  a  and  o,  is  completely 
determined  by  the  joint  distribution  /j  of  mln+1  and  m2in+i,  given  a  and  o.  If  we 
define  this  marginal  distribution  by  <7j(mr  n+1|a,<r),  it  is  straightforward  to  show,  using 
(8),  that 


0j(mr>n+1|a,<r)  = 


\/25r<7r 


exp 


1  ( mr,n+i  ~  rai  -  (1  -  r)a2  -  ' 

2  V  J 


(18) 


where  <r*  =  r2o\  +  (l  —  r)2a\  +  2r(l  —  r)pOi02.  If  we  also  define  gr(mri„+i|mn)  to  be 
the  marginal  distribution  corresponding  to  the  joint  distribution  /(miin+x,m2,n+i|>,nn), 
then 


</(mri„+i|mn)  =  J  dado  ffj(mfi„+i|a,  <r)/3(a,<r|mn).  (19) 

There  is  zui  analogous  expression,  ge,  for  the  constrained  marginal  distribution.  These 
are  the  distributions  that  will  be  used  to  test  compliance. 
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SECTION  4 


YIELD  ESTIMATION  AND  TESTING  COMPLIANCE 


For  Bi  —  Bi  =  1,  the  log  yield  may  be  estimated  in  terms  of  the  new 
weighted  magnitude  mr  n+1  by 

Wn+l  =  rrtr^i  -  rAi  -  (1  -  r)Aj  (20) 

A  A  A  A 

where  AitA2  are  estimates  of  the  intercepts.  For  data  only,  Ax,A2  may  be  taken  to 
be  the  usual  point  estimates.  Using  prior  information  only,  we  can  take  Ax  =  pa, 
and  A]  =  pOJ.  When  data  and  prior  information  are  available,  the  posterior  means  of 
a\,a2  are  the  most  natural  estimates  of  the  intercepts.  The  posterior  means  may  be 
computed  as  expectation  values  of  a1?aj  using  /sf(a,<y|mn). 

We  will  establish  a  test  of  hypothesis  based  on  the  magnitude  mr  n+1,  and 
compute  the  power  and  F-number  of  the  test.  The  power  of  the  test  is  the  proba¬ 
bility  of  rejecting  the  hypothesis  that  the  true  yield  is  in  compliance  with  the  treaty 
threshold.  Note  that  basing  the  test  on  the  magnitude  is  equivalent  to  basing  the 
test  on  the  AFTAC  estimator  of  the  log  yield,  which  is  given  by  (20),  using  constant 
panel  estimates  /*„,  and  pa,  for  the  unknown  intercepts.  The  critical  value  of  the  test 
(defined  below)  in  terms  of  this  estimator  is  shifted  by  the  constant  r/i0l  +  (1  -  r)paj 
relative  to  the  critical  value  of  the  test  in  terms  of  the  magnitude.  The  test  is  of  course 
unchanged  by  this  constant  shift. 

Alternatively,  an  estimate  of  the  log  yield,  given  in  terms  of  the  new  mag¬ 
nitude  and  the  posterior  means  of  ai  and  a2)  could  be  used  in  principle  to  establish  a 
test  of  hypothesis.  It  is  not  clear  whether  this  test  is  significantly  different  or  better 
than  the  test  in  terms  of  the  magnitude.  This  test  is  complicated,  however,  by  the 
fact  that  the  distribution  of  Wn+l,  in  this  case,  is  unknown.  Shumway  and  Der  [1990} 
encounter  this  obstacle  in  their  analysis  as  well.  Assuming  unknown  slopes  in  their 
work  further  complicates  this  matter.  Rather  than  establishing  a  test  of  hypothesis 
in  terms  of  the  estimate  of  the  log  yield,  they  chose  to  compute  confidence  intervals 
for  the  new  yield  considered  as  a  true  fixed  future  observation. 

Consider  now  a  100a%  significance  level  test  of  the  null  hypothesis,  Ho  : 
W  <  Wt ,  versus  the  alternative,  H\:W  >  WT,  of  the  form: 
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(21) 


Reject  H0  if  mr>n+1  >  Ta, 

where  WT  is  the  treaty  threshold  of  the  log  yield,  and  Ta  satisfies 


P  [mrin+1  >  Ta\mn,  Wn+1  =  WT )  =  a.  (22) 

That  is,  assuming  the  true  yield  is  at  the  treaty  threshold,  the  critical  value,  Ta,  is 
determined  such  that  the  hypothesis  Ho  is  falsely  rejected  only  100a%  of  the  time. 
Using  (19),  (22)  is  equivalent  to 

roo 

/  dWr.n+i  g{mr,n+x\rhn,Wn+i  =  WT)  =  a.  (23) 

JTn 

Once  Ta  is  determined,  the  power  of  the  test  is  defined  to  be  the  probability  that  the 
null  hypothesis  is  rejected,  i.e.,  for  fixed  Ta,  the  power  at  Wn+i  =  W0  is  given  by 


Power (W0)  =  P  [mr,n+1  >  Ta  |mn,  Wn+l  =  WQ\ .  (24) 

The  F-number  of  the  test  is  related  to  the  power.  Using  the  definition  given 
by  Alewine  et  al.  [1988],  the  F-number  is  given  by 

F  =  10Wf~Wt,  (25) 

where  Wp  is  the  value  of  the  true  log  yield  at  which  the  power  is  0.5.  That  is,  Wp 
satisfies  the  equation 


P  [mr,n+1  >  Ta|mn,  Wn+l  =  WF]  =  0.5.  (26) 

It  is  important  to  understand  that  the  power  functions,  computed  in  this 
way,  represent  the  probabilities  that  the  null  hypothesis  (i.e.,  the  hypothesis  that  the 
yield  of  the  event  was  in  compliance  with  the  treaty  threshold)  is  rejected,  only  if  the 
underlying  assumptions  are  valid.  That  is,  the  expression  in  (24)  is  the  true  probability 
of  calling  a  violation  only  if  the  weighted  magnitude  of  the  new  event  is  distributed  as 
in  (19)  for  the  unconstrained  case,  or  analogously  for  the  constrained  case.  Thus,  for 
example,  it  is  assumed  that  the  intercepts  are  distributed  as  in  (15).  This  expression 
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for  the  unconstrained  case,  and  the  analogous  one  for  the  constrained  case,  depend 
on  the  a  priori  assumptions.  For  practical  monitoring  applications  these  assumptions 
may  not  hold,  however,  they  reflect  our  best  estimation  of  the  unknown  calibration 
parameters  based  on  a  priori  information,  calibration  data  and  no-yield  data  (in  the 
constrained  approach).  As  a  result,  the  actual  false  alarm  rate  for  monitoring  a 
particular  site  may  differ  from  the  target  value  of  a.  We  investigate  this  issue  in  more 
detail  in  Section  6. 
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SECTION  5 


POWER  COMPARISON 


In  order  to  assess  the  impact  of  imposing  the  prior  information,  particularly 
the  constraints,  it  is  useful  to  compare  the  power  of  the  test  for  the  constrained 
and  unconstrained  distributions.  We  will  consider  a  special  case  of  the  formulation 
above,  by  treating  the  covariance  matrix  associated  with  the  random  errors  as  known, 
without  uncertainty,  so  that  analytic  expressions  may  be  obtained  for  the  power  and 
F-number.  Nicholson  et  al.  (199l|  also  make  this  assumption  in  their  analysis.  (A 
numerical  simulation  is  needed  to  compute  the  power  for  the  general  case.  Details 
of  the  simulation  will  be  contained  in  a  future  report.)  Consider  the  following  four 
100a%  significance  level  tests: 


test  1 


test  2 

test  3 
test  4 


a  test  of  hypothesis  based  on  the  assumption  that  the  calibration 
parameters  (intercepts,  slopes  and  covariance  matrix)  are  completely 
known.  In  practice,  this  test  is  infeasible,  however,  it  serves  as  a 
reference  of  the  maximum  possible  power  for  fixed  false  alarm  rate, 
a  test  of  hypothesis  based  on  the  constrained  Bayesian  approach, 
treating  only  the  intercepts  as  unknown.  (The  only  constraint  used  for 
this  study  is  the  one  on  the  mean  of  the  difference  of  the  magnitudes.) 
a  test  of  hypothesis  based  on  the  unconstrained  Bayesian  approach, 
treating  only  the  intercepts  as  unknown. 

a  test  of  hypothesis  based  on  calibration  data  only,  treating  only  the 
intercepts  as  unknown. 


Tests  1-4,  of  the  form  (21),  may  be  established  by  determining  the  distribu¬ 
tions  of  mr  n+i  under  the  appropriate  assumptions.  The  expressions  for  the  power,  as 
computed  in  the  previous  section,  and  in  more  detail  below,  are  the  probabilities  of 
detecting  a  violation  for  fixed  false  alarm  rate,  when  the  assumptions  corresponding 
to  each  of  the  tests  are  valid.  Thus,  they  represent  the  optimum  power  of  each  test. 


For  simplicity,  and  since  the  covariance  matrix  is  assumed  to  be  known,  let 
P  —  /»(*!>  c*)>  =  i  an<*  Oj  —  To  establish  test  1,  note  that 


AT 

Or 


AT(0,1), 


(27) 
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where  Ar  =  rA\  +  (1  -  r)  A2  and  o*  =  r*a\  +  (1  —  r)*o\ + 2r(l  —  r)pOxtJ2  •  (Cf.  Alewine  et 
al.  [1988]  and  Gray  and  Woodward  [1990]).  To  establish  test  4  (previously  considered 
by  Hafemeister  [1987]),  we  first  define  the  usual  point  estimate  of  Ar  by 

Ar  =  -  (rmi*  +  (!  “  r)m«  “  •  (28) 

n  t=i 


It  may  then  be  shown  that 


~  tf(0,l). 

fi+iV/J 


nj 


Or 


(29) 


The  distributions  of  mr>n+i|m„  for  tests  2  and  3  may  be  computed  using 
(8)-(ll),  (15)  and  (18)  in  (19),  where  £  is  now  treated  as  a  fixed  matrix  of  known 
constants.  Making  these  insertions,  it  is  straightforward  to  show  that  for  the  con¬ 
strained  case 


^r.n+l  Ac 

\Jo3r  +  °\ 


N(  0,1), 


(30) 


while  for  the  unconstrained  case 


1  Wn+ 1  Au 


N(  0,1), 


(31) 


where 


«?  =  [u'(e;i  +  „E'1)u]"1 

(32) 

,  r'i(E;,+nE-,)rJ. 

"  det(E~‘  +  nE'1) 

(33) 

,  -r.„  +  "(X-OJ'E-U 
e  u'(£  J1 +  «£-')« 

(34) 
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(35) 


A .  =  r>„  +  r’  (E;1  +  n£~‘) nE'1  (X  -  0.)  . 

In  these  expression,  we  have  defined  the  vectors  u'  =  (1,1),  r*  =  (r,  1  —  r),  rj.*  = 
(1  —  r,  —  r),  X  =  (mj  -  W,fn2  —  W),  ft'a  =  (ManMaa)>  where  the  prime  is  used  to 
denote  the  transpose  of  a  column  vector.  We  have  also  defined 


E  = 


cl  p0\02 
P°\°l  o\ 


(36) 


S„  = 


'a  i 


Pa^ai^a? 


(37) 


and  £  1  and  £a  1  denote  their  matrix  inverses. 

Using  the  distributions  of  (27)  and  (29)-(31),  the  critical  values  of  the  four 
tests  may  be  obtained  as  in  (23)  such  that  the  area  under  the  distributions,  to  the 
right  of  the  critical  value,  is  a.  The  resulting  critical  values  of  the  tests  Eire 


Tic  =  ZaoT  +  Wr  +  Ar 
T2a  =  Za\jo?  +  o]  +  WT  +  Ac 
Tsa  =  Za\Jo r  *f  o\  +  Wt  +  Au 
T^a  =  Zaor\J  1  +  1/n  +  Wt  +  Ar, 


(38) 


where  Za  is  the  100(1  -  a)  percentile  of  iV(0, 1). 

The  power  functions  of  the  tests,  at  log  yield  W0,  when  the  appropriate 
assumptions  are  correct,  are  given  by 
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POWER(Wo)i  =  erf z(Za  + (WT -WQ)far ) 


POWER(W0)2  = 


POWER(W0)S  = 


POWER(W0)4  = 


erfc  [Za  +  ( WT  -  W0)/y/o*+o *) 
erfc  (za  +  (WT  -  W0)/y/a*+ol ) 
erfc  (za  +  (WT  -  W0)/aryJl  +  1/n)  , 


where  erfc(x)  is  the  complementary  error  function  given  by 


a(<x)  =  7S  C 


It  may  also  be  shown  that  the  F-numbers  of  the  tests  are  given  by 


=  10*“"' 


Fs  =  ioz“V^+^ 

Fa  =  10  Z-Oryfi+ijn 

Note  that  although  the  critical  values  for  tests  2,  3  and  4  depend  on  the 
calibration  data,  the  power  functions  and  F-numbers  depend  only  on  the  sample  size 
n,  and  not  on  the  particular  sample.  Also,  it  is  straightforward  to  show  that  if  either 
n  — »  oo  or  o0,,oaa  — ►  0,  then  <re,  ou  —»  0. 

Since  these  formulas  are  somewhat  unwieldy,  we  have  computed  the  power 
curves  and  F-numbers  of  the  four  tests  for  the  cases  listed  in  Table  1.  We  have  set 
a  =  0.025  and  W?  =  log  150.  Figures  1-8  show  the  resulting  power  of  the  four  tests 
as  functions  of  the  true  yield,  denoted  by  solid,  dashed,  dotted  and  dashed-dotted 
curves,  respectively.  The  four  frames  in  each  figure  show  the  results  for  n  =  0, 1,2,3. 
The  power  curves  of  test  4  may  be  computed  only  for  non-zero  values  of  n.  The 
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Table  1.  List  of  parametric  cases  for  the  power  comparison. 


Case 

0\ 

<7j 

P 

Pa 

1 

mm 

1 

mm 

2 

: 

■SI 

H 

3 

0.05 

0.05 

0.5 

0.05 

0.10 

Mm 

4 

0.05 

0.05 

0.5 

0.05 

0.10 

0.5 

5 

0.05 

0.10 

0.5 

0.05 

0.05 

0.0 

6 

0.05 

0.10 

0.5 

0.05 

0.05 

0.5 

7 

0.05 

0.10 

0.5 

0.05 

0.10 

0.0 

8 

0.05 

0.10 

0.5 

0.05 

0.10 

0.5 

corresponding  F-numbers  are  given  in  the  legends.  Note  that  in  Figures  1,  2,  5  and 
6,  the  power  curves  of  tests  2  and  3  are  identical. 

There  are  several  significant  points  to  make  concerning  these  results: 

1.  Test  1  is  clearly  the  test  with  the  greatest  power  in  all  cases.  This  is  expected 
since  there  is  no  uncertainty  in  the  calibration  parameters. 

2.  The  power  of  test  2  is  greater  than  or  equal  to  the  power  of  test  3  and,  hence, 
the  F-numbers  of  test  2  are  less  than  or  equal  to  those  of  test  3  in  all  cases. 

3.  Equality  occurs  for  cases  in  which  the  standard  deviations  of  the  random  errors 
and  int  cepts  are  the  same  for  both  magnitudes  (e.g.,  Figures  1,  2  for  all  values 
of  n  and  Figures  5,  6  for  n  =  0.)  Note  that  for  n  =  0,  the  random  errors 
contribute  identically  to  the  power  of  tests  2  and  3  via  a *.  For  n  >  0,  Figures  5 
and  6  show  that  the  power  of  test  2  is  slightly  greater  than  that  of  test  3  even 
though  the  uncertainty  in  the  intercepts  is  the  same  for  both  magnitudes. 

4.  The  greatest  increase  in  power  of  test  2  relative  to  test  3  occurs  when  the  un¬ 

certainty  in  one  of  the  intercepts  is  much  larger  than  the  other  (e.g.,  Figures  3, 
4,  7,  8).  Note  that  for  n  >  0  the  power  of  test  3  rapidly  converges  to  that  of 
test  2  for  the  cases  shown  in  Figures  3  and  4.  Figures  7  and  8  show  that  the 
convergence  is  slower  for  cases  in  which  a2  >  . 

5.  In  all  cases,  the  power  curves  of  tests  2,  3  and  4  converge  to  the  power  curve  of 
test  1  as  n  increases. 
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6.  In  all  cases,  test  4  (based  only  on  calibration  data)  has  the  least  power  of  the 
four  tests. 

These  results  show  that  the  optimum  power  of  the  constrained  test  is  always 
at  least  as  great  as  that  of  the  unconstrained  test.  The  constraint,  based  on  the  no¬ 
yield  data,  provides  the  greatest  benefit  for  cases  in  which  there  is  greater  uncertainty 
in  one  of  the  intercepts.  It  is  not  surprising  that  for  equivalent  a  priori  assumptions 
regarding  the  uncertainty  in  the  intercepts,  the  constrained  and  unconstrained  tests 
provided  similar  results.  The  power  computed  for  test  2  assumes  that  a  constraint 
physically  exists  and  makes  use  of  that  information  to  compute  the  critical  value  of 
the  test.  Alternatively,  the  power  computed  for  test  3  assumes  that  the  difference  in 
the  intercepts  is  not  necessarily  a  fixed  constant  and,  hence,  ignores  this  information 
when  computing  the  critical  value  of  the  test.  Thus,  the  power  functions  of  tests  2 
and  3  represent  the  optimum  power  of  each  test,  but  they  are  being  compared  for  two 
physically  different  situations. 

A  useful  comparison  of  tests  2  and  3  would  be  to  assume  that  the  magni¬ 
tudes,  are  distributed  as  assumed  in  the  constrained  case,  but  only  the  constrained 
approach  makes  use  of  that  information  to  set  the  critical  value.  This  comparison 
is  more  realistic  since,  in  practice,  the  difference  in  the  intercepts  and  hence  the  dif¬ 
ference  in  the  expected  magnitudes  are  given  by  a  fixed  constant.  The  constrained 
approach  makes  use  of  the  no-yield  data  to  determine  what  this  fixed  constant  is, 
while  the  uncontrained  approach  does  not. 

Rather  than  perform  this  comparison,  for  which  the  Bayesian  assumptions 
would  still  differ  from  the  assumptions  of  test  1,  we  will  examine  the  case  in  which 
the  magnitudes  are  distributed  as  assumed  in  tests  1  and  4,  i.e.,  as  in  (1),  but  the 
critical  values  of  eacty  test  are  set  as  above.  The  actual  probabilities  of  calling  a 
violation  based  on  each  of  the  four  tests  may  then  be  compared  under  the  same  test 
site  conditions.  This  comparison  is  contained  in  the  following  section. 
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SECTION  6 


ROBUSTNESS 


An  important  issue  regarding  the  monitoring  of  a  particular  test  site  is  the 
robustness  of  the  test  of  compliance  to  the  assumptions  that  are  made  in  the  statistical 
methodology.  In  particular,  the  test  of  compliance,  based  on  a  Bayesian  approach, 
depends  on  the  assumed  parameters  of  the  prior  distribution.  It  is  important  to 
note  that  the  expression  for  the  power  in  (24),  and  the  power  of  the  Bayesian  tests 
computed  in  the  previous  section  are  the  true  probabilities  of  calling  violations  only 
if  the  magnitudes  of  the  new  events  are  distributed  as  in  (19)  for  the  unconstrained 
case,  or  the  analogous  expression  for  the  constrained  case.  The  expressions,  however, 
are  only  estimates  of  the  true  distribution  which  depends  on  the  true,  but  unknown, 
intercept,  slope  and  covariance  matrix  parameters. 

An  alternative  interpretation  of  the  power  of  the  Bayesian  tests  is  the  fol¬ 
lowing.  Consider  the  case  where  n  =  0.  Then  the  power  computed  above  is  the  true 
probability  of  calling  a  violation  only  if  the  assumed  prior  is  correct,  i.e.,  only  if  the 
intercepts  are  actually  distributed  as  assumed  in  the  constrained  or  unconstrained 
priors.  In  this  sense,  the  joint  priors  may  be  interpreted  as  distributions  of  all  possi¬ 
ble  test  sites.  For  a  particular  test  site,  however,  the  intercepts  are  not  distributed. 
Rather,  they  have  fixed,  although  unknown  values.  Thus,  the  false  alarm  rate  using  a 
Bayesian  approach,  for  a  sequence  of  events  at  a  particular  test  site,  is  not  necessarily 
a.  We  refer  to  the  false  alarm  rate,  power  and  F-number  in  this  case  as  the  “actual” 
quantities,  “conditional”  on  the  values  of  the  intercepts  for  the  given  test  site. 

Here  we  assess  the  robustness  of  the  actual  power  of  the  tests  based  on  the 
Bayesian  approaches.  That  is,  suppose  we  set  the  critical  value  Ta  as  before.  We  now 
want  to  determine  the  actual  power  of  the  test,  conditional  on  the  true  values  of  the 
intercepts,  A'  =  (Aj.Aj),  when  the  means  of  the  Bayesian  prior  are  not  necessarily 
equal  to  them,  e.g.,  we  will  consider  cases  in  which 


M<».  =  A,  ±  coai,  c  =  0.  ±1 ,  +2.  (42) 

The  conditional  power  is  the  probability  that  mr  is  greater  than  the  critical 
value,  weighted  over  the  distribution  of  all  possible  critical  values.  Thus, 
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POWER(Wo|A)  =  f  dTa  p[Ta\mn,W  =  W0]  P[mr  >  Ta;A,E|W  =  W0\,  (43) 


where 


P[mr  >  Ta;  AtH\W  =  W0 ]  =  -jL-  f°°  dmr  exp  - \  ( 

\2‘kot  JTa  2  \ 


1  ( ^r,n+l  Ar  VVn 


It  may  be  shown  that  the  actual  power  functions  of  tests  1  and  4  are  the  same  as 
those  computed  in  (39).  The  distributions  of  the  critical  values  for  tests  2  and  3  may 
be  determined  from  (32)-(35)  and  (38).  Also,  since  Ae  and  Au  depend  linearly  on  the 
randomly  distributed  mean  magnitudes  mj  and  m2,  it  may  be  shown  that,  for  tests  2 
and  3,  Ta  is  distributed  as 


Ta  -  Mr 


*(0,1), 


where  for  the  constraineo  ~ase 


«■-  -  ™  +  «”»  +  *  + 


=  Var\Tia\  = 


nu'E-'a 

[u'(E~l  +  n£_1)u]! 


while  for  the  unconstrained  case 


MT3„  =  E^}  =  Zed  a*  +  o*  +  WT  +  Ar  +  r'(Ea-‘  +  nSTT^V  -  A)  (48) 


43o  =  Var\T3a\  =  nr'(E;‘  +  nST1)-1^'^-1  +  nE"1)"^  (49) 

In  obtaining  these  expressions,  it  is  assumed  that  the  magnitudes  are  distributed  as  in 
(1),  and  r'A  =  Ar.  No«.e  that  in  the  constrained  case,  A2  =  Ax-  p  and  paj  =  pai  -  m. 
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hence,  (fia  —  A)'  =  (/x0l  -  Ai)u'  =  (/i„3  -  Aj)u'.  This  relationship  does  not  hold  for 
the  unconstrained  case. 

Using  (43)-(49),  the  actual  power  for  tests  2  and  3,  conditional  on  A,  may 
be  expressed  as 


POWER(W'olA)  =  erfc  (fo) , 


(50) 


where 


Mr  ~  Ar  —  WQ 

+  <4 


(SI) 


It  is  straightforward  to  show  t:>at  as  n  — ►  oo,  MrJa>Mrs„  — *  Zaor  +  Wt  +  Ar  and 
°TJa  *  °T3a  0.  Thus,  in  the  limit  as  n-*  oo,  the  power  curves  of  tests  2  and  3 
converge  to  that  of  +<at  1,  and  the  false  alarm  rates  of  tests  2  and  3  approach  a.  It 
may  also  be  shown  that  the  same  result  occurs  if  /ia  =  A  and  oai ,  o0j  — >  0. 

To  illustrate  the  results,  we  have  plotted  the  actual  power  curves  of  the  four 
tests  as  functions  of  the  yield  for  the  cases  listed  in  Table  2.  For  each  of  these  cases  we 
have  considered  five  combinations  of  the  prior  means  of  the  intercepts  listed  in  table 
3. 


Table  2.  List  of  parametric  cases  for  the  conditional  power  comparison. 


Case 

°\ 

07 

P 

oai 

Pa 

1 

jmm 

1 

1 

0 

2 

;  / 

0.5 

1  i 

3 

0.05 

0.05 

0.5 

0.05 

0.15 

0 

Figures  9(a-e)-ll(a-e)  show  the  results  for  these  cases.  As  before,  the  solid,  dashed 
and  dotted  curves  represent  the  power  functions  of  tests  1,  2  and  3,  respectively.  The 
four  frames  in  each  figure  show  the  results  for  n  =  0, 1,2,3.  The  actual  false  alarm 
rates  achieved  by  each  of  the  tests  are  shown  in  the  figure  legends.  The  power  curves 
of  test  1  reprc»«nt  the  maximum  probability  of  detecting  a  violation,  at  a  particular 
yield,  for  a  fixed  false  alarm  rate.  This  test  is  infeasible,  however,  the  performance  of 
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Table  3. 


List  of  prior  means  used  for  each  of  the  cases  listed  in  Table  2. 


Case 

(m«h  Ax)/oai 

(paa  A2)/<raa 

a 

0 

0 

b 

0 

2 

c 

0 

-2 

d 

2 

2 

e 

-2 

-2 

the  other  two  tests  should  be  measured  by  the  similarity  of  the  their  power  curves  to 
those  of  test  1. 

The  significant  results  of  this  study  are  the  following: 

1.  For  all  cases  in  which  neither  of  the  means  of  the  prior  were  chosen  to  be  two 
standard  deviations  less  than  the  true  intercepts,  the  power  of  test  2  is  greater 
than  or  equal  to  that  of  test  3,  i.e.,  except  for  those  cases  labelled  by  “c”  and 
V\ 

2.  The  cases  for  which  the  power  of  test  2  is  dramatically  better  than  that  of  test 
3  are  those  in  which  paa  =  A2  +  2 oaj,  particularly  if  pa,  =  Ax,  but  even  if 
**a,  =  Ax  +  2 oai.  See,  for  example,  Figures  9(b,d)-ll(b,d).  Also,  cases  in  which 
oa2  is  significantly  greater  than  ffa,  lead  to  greater  relative  power  of  test  2  to 
test  3.  Compare  Figure  9,  10  and  11. 

3.  The  power  of  tests  2  and  3  are  equal  if  the  <7a,  =  a„3  and  ox  =  o2  (provided 
n  =  0).  See,  for  example,  Figures  9a  for  all  values  of  n  and  12a  for  n  =  0. 

4.  For  those  cases  labelled  by  “c”  and  “e” ,  in  which  one  or  both  of  the  means  of  the 
prior  were  chosen  to  be  two  standard  deviations  less  than  the  true  intercepts, 
the  power  of  test  3  is  greater  than  that  of  test  2,  but  the  false  alarm  rates  of 
test  3  are  also  noticeably  greater  than  0.025.  The  false  alarm  rates  of  test  2  are 
greater  than  0.025  only  for  the  cases  labelled  by  “e”,  but  are  still  considerably 
smaller  than  the  false  alarm  rates  of  test  3. 

5.  For  the  cases  examined,  the  largest  actual  false  alarm  rate  for  test  2  was  0.084 
(Figure  9e),  while  for  test  3  the  largest  actual  false  alarm  rate  was  0.705.  After 
as  few  as  two  calibration  events  the  false  alarm  rate  of  test  2  was  no  greater 
than  0.055  for  all  of  these  cases. 
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6.  The  power  of  test  2  (constrained  Bayesian)  was  less  than  that  of  the  test  4  (cal¬ 
ibration  data  only),  except  for  cases  in  which  both  prior  means  of  the  intercepts 
were  two  standard  deviations  greater  than  the  true  intercepts. 

These  results  show  that  the  constrained  Bayesian  approach  is  far  more  ro¬ 
bust  than  the  unconstrained  approach.  That  is,  the  power  curves  of  test  2  (constrained 
Bayesian  approach)  are  far  less  sensitive  to  uncertainties  in  the  intercepts  and  incor¬ 
rect  assumptions  regarding  the  prior  means  of  the  intercepts.  It  was  much  more  likely 
that  test  3  (unconstrained  Bayesian  approach)  had  dramatically  less  power  or,  al¬ 
ternatively,  a  false  alarm  rate  that  was  dramatically  too  large.  For  all  of  the  cases 
considered  here,  the  power  of  test  2  was  always  closer  to  the  target  power  of  test  1 
than  the  power  of  test  3. 

Comparisons  to  test  4  show  that  the  power  of  test  2  is  greater  in  all  cases, 
except  those  for  which  both  prior  means  of  the  intercepts  are  considerably  larger 
than  the  true  values.  The  priors,  however,  are  presummably  assigned  with  sufficient 
confidence  so  that  the  probability  of  such  an  occurance  is  extremely  small  (<  2%).  In 
contrast,  if  either  or  both  of  the  prior  means  were  two  standard  greater  than  the  true 
intercepts,  the  power  of  test  3  was  less  than  that  of  test  4. 

The  study  just  described  was  repeated  for  ot  =  0.10,  with  all  of  the  other 
parameters  in  Tables  2  and  3  the  same  as  before.  The  results  are  very  similar  to  those 
presented  in  Figures  9-11.  The  relative  improved  performance  of  test  2  to  test  3  was 
slightly  more  pronounced  for  these  cases  as  it  was  for  the  power  comparison  in  Section 
5. 
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SECTION  7 


CONCLUSIONS  AND  FUTURE  WORK 


In  this  report,  we  have  developed  a  constrained  Bayesian  approach  for  es¬ 
timating  the  yield  for  a  future  underground  test  based  on  expert  prior  information, 
calibration  data  (if  available)  and  no-yield  data.  The  approach  was  formulated  for  the 
case  in  which  the  intercepts  and  the  covariance  matrix  of  the  random  seismic  errors 
are  unknown. 

The  results  of  the  power  comparison  in  Section  5,  treating  only  the  intercepts 
as  unknown,  showed  that  the  test  of  hypothesis  based  on  the  contrained  Bayesian 
approach  is  always  as  good  or  better  than  the  Bayesian  test  without  the  constraint, 
and  is  always  better  than  the  test  based  only  on  calibration  data  when  the  appropriate 
Bayesian  assumptions  hold.  The  constrained  approach  is  particularly  useful  for  cases 
in  which  the  uncertainty  in  one  of  the  intercepts  is  relatively  large,  and  little  or  no 
calibration  data  is  available. 

The  robustness  study  in  Section  6  showed  that  the  constraint,  based  on  the 
no-yield  data,  greatly  corrects  for  poor  prior  information  regarding  either  or  both 
intercepts.  If  the  prior  means  of  the  intercepts  are  chosen  too  small,  the  false  alarm 
rate  of  the  constrained  approach  may  exceed  0.025,  but  will  be  considerably  closer 
to  the  desired  value  than  the  false  alarm  rate  of  the  unconstrained  approach.  The 
actual  power  of  the  unconstrained  test  was  greater  in  these  cases,  but  this  test  is  not 
favored  because  of  the  high  false  alarm  rate.  In  all  other  cases,  the  actual  power  of 
the  constrained  test  was  at  least  as  great  as  that  of  the  unconstrained  test. 

If  there  is  no  calibration  data,  the  constrained  approach  is  the  best  of  the 
methods  explored  here  for  testing  TTBT  compliance.  If  the  experts  are  confident 
that  both  prior  means  of  the  intercepts  have  not  been  dramatically  underestimated 
or  overestimated,  the  power  of  the  constrained  approach,  for  significance  level  less 
than  or  equal  to  0.025,  is  greater  than  the  approach  based  only  on  calibration  data, 
requiring  roughly  half  as  many  calibration  events  to  achieve  the  same  F-number  as 
the  latter  test. 

In  the  future,  a  simulation  should  be  performed  to  compare  the  power  func¬ 
tions  of  tests  similar  to  tests  2,  3  and  4  above,  but  also  treating  the  covariance  matrix 
of  the  random  seismic  errors  as  unknown,  which  indeed  it  is.  This  simulation  is  near 
completion,  but  we  were  unable  to  obtain  the  results  in  time  for  this  report.  The 
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importance  of  treating  the  covariance  matrix  as  unknown  is  illustrated  in  Figures  12 
and  13,  which  show  the  power  curves  of  three  tests  for  two  different  values  of  a%. 
Tests  1  and  2  correspond  to  tests  1  and  4  above,  i.e.,  treating  all  of  the  parameters  as 
known  for  the  first,  and  only  the  intercepts  as  known  for  the  second.  For  test  3,  the 
intercepts  and  the  covariance  matrix  are  treated  as  unknown,  using  only  calibration 
data  to  estimate  them.  This  test  may  be  based  on  Student’s  f-distributions  as  shown 
by  Alewine  et  ai.  [1988]  and  Gray  and  Woodward  [1990]. 

These  figures  show  that  the  treatment  of  the  covariance  matrix  is  very  im¬ 
portant.  In  many  of  the  previous  figures  the  power  curves  of  all  four  tests  were  not 
dramatically  different.  However,  for  the  realistic  case  in  which  the  covariance  matrix 
is  treated  as  unknown,  the  differences  in  the  tests  are  expected  to  be  much  more 
significant.  For  example,  a  test  based  on  calibration  data  alone,  and  the  assumption 
of  unknown  intercepts  and  covariance  matrix,  cannot  be  used  unless  data  for  at  least 
two  calibration  events  are  available. 

There  is  still  considerable  work  that  can  be  done  to  improve  our  approach. 
In  the  future,  we  will  extend  our  analysis  to  treat  the  case  of  unknown  slopes.  This 
extension  is  important  if  this  approach  is  to  have  monitoring  applications  relevant  to  a 
low-yield  or  comprehensive  test  ban  treaty.  We  will  also  explore  the  effect  of  relaxing 
the  assumption  that  n  and  AJ  are  exactly  known.  This  is  a  necessary  modification  in 
order  to  apply  the  constrained  Bayesian  approach  to  monitoring  test  sites  for  which 
there  is  limited  no-yield  data. 
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Figure  1.  Power  curves  of  the  four  tests  as  functions  of  yield  for  the  case  in  which 
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functions  of  yield  for  the  case  in  which 
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Figure  5.  Power  curves  of  the  four  tests  as  functions  of  yield  for  the  case  in  which 
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Figure  8.  Power  curves  of  the  four  tests  as  functions  of  yield  for  the  case  in  which 
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Figure  9a.  Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  at  =  .05,  o*  =  .05,  p  =  .5,  cmt  =  .05,  au  =  .05, 
p.  =  .0,  and  (pi*  -  At)/am1  =  0,  (/a*  -  Aj/c*  =  0. 
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Figure  9e.  Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  =  05,  ff«  =  .05,  p  -  .5,  =  .05,  o«*  =  .0 

p.  =  .0,  and  (p*  -  A,)/am,  =  -2,  (/a*  -  Aj/au  -  -2. 


Figure  10b.  Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  ax  =  .05,  a,  =  .05,  p  -  .5,  am1  =  .05,  =  .10, 

p.  =  .0,  and  0 ~  Aj/a*  -  0,  (p*  -  Aj/ou  =  2. 
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Figure  lOd.  Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  a,  =  .05,  crt  =  .05,  p  =  .5,  am1  =  .05,  a *  =  .10, 
p.  =  .0,  and  04*  -  4JA*  =  2,  04*  -  ^JA-  =  2. 
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Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 


Figure  11a.  Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  ot  =  So,  a t  —  .05,  p  —  .5,  <rm1  =  .05,  o—  =  .15, 
p.  =  .0,  and  (p*  -  A  J/ct.,  =  0,  (p»*  -  A,)/au  =  0. 


Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  ax  =  .05,  *  .05,  p  *  .5,  am1  -  .05,  o*  =  .1 

pm  -  .0,  and  -  A,)/a„  =  0.  (mu  -  Aj/au  =  2. 


Figure  11c.  Actual  power  curves  of  the  four  tests,  as  functions  of  yield,  for 
the  case  in  which  a,  =  .05,  o*  =  .05,  p  =  .5,  a„  =  .05,  =  .15, 

p.  =  .0,  and  -  A,)/i 7„  =  0,  (fj^  -  Aj/o^  =  -2. 
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Figure  13.  Power  curves  of  three  tests  based  on  all  parameters  known  (solid), 
unknown  intercepts  only  (dashed),  and  unknown  intercepts  and 
covariance  matrix  of  the  random  errors  (dotted)  for  the  case  in 
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